AITopics | transition operator

Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

Neural Information Processing SystemsMar-17-2026, 14:04:02 GMT

We propose a novel method to {\it directly} learn a stochastic transition operator whose repeated application provides generated samples. Traditional undirected graphical models approach this problem indirectly by learning a Markov chain model whose stationary distribution obeys detailed balance with respect to a parameterized energy function. The energy function is then modified so the model and data distributions match, with no guarantee on the number of steps required for the Markov chain to converge. Moreover, the detailed balance condition is highly restrictive: energy based models corresponding to neural networks must have symmetric weights, unlike biological neural circuits. In contrast, we develop a method for directly learning arbitrarily parameterized transition operators capable of expressing non-equilibrium stationary distributions that violate detailed balance, thereby enabling us to learn more biologically plausible asymmetric neural networks and more general non-energy based dynamical systems. The proposed training objective, which we derive via principled variational methods, encourages the transition operator to walk back (prefer to revert its steps) in multi-step trajectories that start at data-points, as quickly as possible back to the original data points. We present a series of experimental results illustrating the soundness of the proposed approach, Variational Walkback (VW), on the MNIST, CIFAR-10, SVHN and CelebA datasets, demonstrating superior samples compared to earlier attempts to learn a transition operator. We also show that although each rapid training trajectory is limited to a finite but variable number of steps, our transition operator continues to generate good samples well past the length of such trajectories, thereby demonstrating the match of its non-equilibrium stationary distribution to the data distribution.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

7749f9c0d5ff109231be21e910a3ced2-Paper-Conference.pdf

Neural Information Processing SystemsFeb-14-2026, 22:52:57 GMT

artificial intelligence, deep learning, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Wisconsin > Dane County > Madison (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Iceland > Capital Region > Reykjavik (0.04)

Genre:

Instructional Material (0.67)
Research Report (0.46)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

Neural Information Processing SystemsNov-21-2025, 15:06:26 GMT

We propose a novel method to {\it directly} learn a stochastic transition operator whose repeated application provides generated samples. Traditional undirected graphical models approach this problem indirectly by learning a Markov chain model whose stationary distribution obeys detailed balance with respect to a parameterized energy function. The energy function is then modified so the model and data distributions match, with no guarantee on the number of steps required for the Markov chain to converge. Moreover, the detailed balance condition is highly restrictive: energy based models corresponding to neural networks must have symmetric weights, unlike biological neural circuits. In contrast, we develop a method for directly learning arbitrarily parameterized transition operators capable of expressing non-equilibrium stationary distributions that violate detailed balance, thereby enabling us to learn more biologically plausible asymmetric neural networks and more general non-energy based dynamical systems. The proposed training objective, which we derive via principled variational methods, encourages the transition operator to walk back (prefer to revert its steps) in multi-step trajectories that start at data-points, as quickly as possible back to the original data points. We present a series of experimental results illustrating the soundness of the proposed approach, Variational Walkback (VW), on the MNIST, CIFAR-10, SVHN and CelebA datasets, demonstrating superior samples compared to earlier attempts to learn a transition operator. We also show that although each rapid training trajectory is limited to a finite but variable number of steps, our transition operator continues to generate good samples well past the length of such trajectories, thereby demonstrating the match of its non-equilibrium stationary distribution to the data distribution.

name change, transition operator, variational walkback, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

Anirudh Goyal ALIAS PARTH GOYAL, Nan Rosemary Ke, Surya Ganguli, Yoshua Bengio

Neural Information Processing SystemsNov-21-2025, 08:13:35 GMT

Moreover, the detailed balance condition is highly restrictive: energy based models corresponding to neural networks must have symmetric weights, unlike biological neural circuits.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Workflow (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

Alex M. Lamb, Devon Hjelm, Yaroslav Ganin, Joseph Paul Cohen, Aaron C. Courville, Yoshua Bengio

Neural Information Processing SystemsNov-21-2025, 06:38:01 GMT

GibbsNet is the best of both worlds both in theory and in practice.

Add feedback

Compensating Distribution Drifts in Class-incremental Learning of Pre-trained Vision Transformers

Rao, Xuan, Xu, Simian, Li, Zheng, Zhao, Bo, Liu, Derong, Ha, Mingming, Alippi, Cesare

arXiv.org Artificial IntelligenceNov-14-2025

Recent advances have shown that sequential fine-tuning (SeqFT) of pre-trained vision transformers (ViTs), followed by classifier refinement using approximate distributions of class features, can be an effective strategy for class-incremental learning (CIL). However, this approach is susceptible to distribution drift, caused by the sequential optimization of shared backbone parameters. This results in a mismatch between the distributions of the previously learned classes and that of the updater model, ultimately degrading the effectiveness of classifier performance over time. To address this issue, we introduce a latent space transition operator and propose Sequential Learning with Drift Compensation (SLDC). SLDC aims to align feature distributions across tasks to mitigate the impact of drift. First, we present a linear variant of SLDC, which learns a linear operator by solving a regularized least-squares problem that maps features before and after fine-tuning. Next, we extend this with a weakly nonlinear SLDC variant, which assumes that the ideal transition operator lies between purely linear and fully nonlinear transformations. This is implemented using learnable, weakly nonlinear mappings that balance flexibility and generalization. To further reduce representation drift, we apply knowledge distillation (KD) in both algorithmic variants. Extensive experiments on standard CIL benchmarks demonstrate that SLDC significantly improves the performance of SeqFT. Notably, by combining KD to address representation drift with SLDC to compensate distribution drift, SeqFT achieves performance comparable to joint training across all evaluated datasets. Code: https://github.com/raoxuan98-hash/sldc.git.

artificial intelligence, dataset, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2511.09926

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

7749f9c0d5ff109231be21e910a3ced2-Paper-Conference.pdf

Neural Information Processing SystemsOct-11-2025, 00:50:43 GMT

artificial intelligence, deep learning, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Wisconsin > Dane County > Madison (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Iceland > Capital Region > Reykjavik (0.04)

Genre:

Instructional Material (0.67)
Research Report (0.46)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

QSearchNet: A Quantum Walk Search Framework for Link Prediction

Dubey, Priyank

arXiv.org Artificial IntelligenceOct-2-2025

Link prediction is one of the fundamental problems in graph theory, critical for understanding and forecasting the evolution of complex systems like social and biological networks. While classical heuristics capture certain aspects of graph topology, they often struggle to optimally integrate local and global structural information or adapt to complex dependencies. Quantum computing offers a powerful alternative by leveraging superposition for simultaneous multi-path exploration and interference-driven integration of both local and global graph features. In this work, we introduce QSearchNet, a quantum-inspired framework based on Discrete-Time Quantum Walk (DTQW) dynamics and Grover's amplitude amplification. QSearchNet simulates a topology-aware quantum evolution to propagate amplitudes across multiple nodes simultaneously. By aligning interference patterns through quantum reflection and oracle-like phase-flip operation, it adaptively prioritizes multi-hop dependencies and amplifies structurally relevant paths corresponding to potential connections. Experiments on diverse real-world networks demonstrate competitive performance, particularly with hard negative samples under realistic evaluation conditions.

data mining, machine learning, qsearchnet, (18 more...)

arXiv.org Artificial Intelligence

2510.00325

Country: North America > United States > Texas (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

paper

Akshay Krishnamurthy

Neural Information Processing SystemsAug-17-2025, 02:07:36 GMT

algorithm, inequality, probability, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Reviews: GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

Neural Information Processing SystemsOct-7-2024, 17:00:52 GMT

This paper presents GibbsNet, a deep generative model formulated as transition operators. The transition operators are learned in an adversarial way, similar to that of the adversarially learned inference (ALI). However instead of using a fixed prior p(z), GibbsNet does not require the specification of a particular prior, but rather learn a prior implicitly. Training is done by unrolling the sampling process multiple times and doing adversarial learning to match the sampling distribution to the one clamped from data and doing posterior only once. When unrolling for only one step GibbsNet becomes equivalent to ALI.

deep graphical model, gibbsnet, iterative adversarial inference, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

Filters

Collaborating Authors

transition operator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

7749f9c0d5ff109231be21e910a3ced2-Paper-Conference.pdf

Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net

GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

Compensating Distribution Drifts in Class-incremental Learning of Pre-trained Vision Transformers

7749f9c0d5ff109231be21e910a3ced2-Paper-Conference.pdf

QSearchNet: A Quantum Walk Search Framework for Link Prediction

paper

Reviews: GibbsNet: Iterative Adversarial Inference for Deep Graphical Models